106 research outputs found
Hot Streaks on Social Media
Measuring the impact and success of human performance is common in various
disciplines, including art, science, and sports. Quantifying impact also plays
a key role on social media, where impact is usually defined as the reach of a
user's content as captured by metrics such as the number of views, likes,
retweets, or shares. In this paper, we study entire careers of Twitter users to
understand properties of impact. We show that user impact tends to have certain
characteristics: First, impact is clustered in time, such that the most
impactful tweets of a user appear close to each other. Second, users commonly
have 'hot streaks' of impact, i.e., extended periods of high-impact tweets.
Third, impact tends to gradually build up before, and fall off after, a user's
most impactful tweet. We attempt to explain these characteristics using various
properties measured on social media, including the user's network, content,
activity, and experience, and find that changes in impact are associated with
significant changes in these properties. Our findings open interesting avenues
for future research on virality and influence on social media.Comment: Accepted as a full paper at ICWSM 2019. Please cite the ICWSM versio
Co-Following on Twitter
We present an in-depth study of co-following on Twitter based on the
observation that two Twitter users whose followers have similar friends are
also similar, even though they might not share any direct links or a single
mutual follower. We show how this observation contributes to (i) a better
understanding of language-agnostic user classification on Twitter, (ii)
eliciting opportunities for Computational Social Science, and (iii) improving
online marketing by identifying cross-selling opportunities.
We start with a machine learning problem of predicting a user's preference
among two alternative choices of Twitter friends. We show that co-following
information provides strong signals for diverse classification tasks and that
these signals persist even when (i) the most discriminative features are
removed and (ii) only relatively "sparse" users with fewer than 152 but more
than 43 Twitter friends are considered.
Going beyond mere classification performance optimization, we present
applications of our methodology to Computational Social Science. Here we
confirm stereotypes such as that the country singer Kenny Chesney
(@kennychesney) is more popular among @GOP followers, whereas Lady Gaga
(@ladygaga) enjoys more support from @TheDemocrats followers.
In the domain of marketing we give evidence that celebrity endorsement is
reflected in co-following and we demonstrate how our methodology can be used to
reveal the audience similarities between Apple and Puma and, less obviously,
between Nike and Coca-Cola. Concerning a user's popularity we find a
statistically significant connection between having a more "average"
followership and having more followers than direct rivals. Interestingly, a
\emph{larger} audience also seems to be linked to a \emph{less diverse}
audience in terms of their co-following.Comment: full version of a short paper at Hypertext 201
A Motif-based Approach for Identifying Controversy
Among the topics discussed in Social Media, some lead to controversy. A
number of recent studies have focused on the problem of identifying controversy
in social media mostly based on the analysis of textual content or rely on
global network structure. Such approaches have strong limitations due to the
difficulty of understanding natural language, and of investigating the global
network structure. In this work we show that it is possible to detect
controversy in social media by exploiting network motifs, i.e., local patterns
of user interaction. The proposed approach allows for a language-independent
and fine- grained and efficient-to-compute analysis of user discussions and
their evolution over time. The supervised model exploiting motif patterns can
achieve 85% accuracy, with an improvement of 7% compared to baseline
structural, propagation-based and temporal network features
Professional Gender Gaps Across US Cities
Gender imbalances in work environments have been a long-standing concern.
Identifying the existence of such imbalances is key to designing policies to
help overcome them. In this work, we study gender trends in employment across
various dimensions in the United States. This is done by analyzing anonymous,
aggregate statistics that were extracted from LinkedIn's advertising platform.
The data contain the number of male and female LinkedIn users with respect to
(i) location, (ii) age, (iii) industry and (iv) certain skills. We studied
which of these categories correlate the most with high relative male or female
presence on LinkedIn. In addition to examining the summary statistics of the
LinkedIn data, we model the gender balance as a function of the different
employee features using linear regression. Our results suggest that the gender
gap varies across all feature types, but the differences are most profound
among industries and skills. A high correlation between gender ratios of people
in our LinkedIn data set and data provided by the US Bureau of Labor Statistics
serves as external validation for our results.Comment: Accepted at a poster at ICWSM 2018. Please cite the ICWSM versio
The Effect of Collective Attention on Controversial Debates on Social Media
We study the evolution of long-lived controversial debates as manifested on
Twitter from 2011 to 2016. Specifically, we explore how the structure of
interactions and content of discussion varies with the level of collective
attention, as evidenced by the number of users discussing a topic. Spikes in
the volume of users typically correspond to external events that increase the
public attention on the topic -- as, for instance, discussions about `gun
control' often erupt after a mass shooting.
This work is the first to study the dynamic evolution of polarized online
debates at such scale. By employing a wide array of network and content
analysis measures, we find consistent evidence that increased collective
attention is associated with increased network polarization and network
concentration within each side of the debate; and overall more uniform lexicon
usage across all users.Comment: accepted at ACM WebScience 201
- …